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BANDWIDTH SAVINGS AND QoS IMPROVEMENT FOR WWW SITES 
BY CACHING STATIC AND DYNAMIC CONTENT ON A DISTRIBUTED NETWORK 
OF CACHES 

S TECHNICAL FIELD OF THE INVENTION 

The invention relates to the storage and transport of information on an Internet and, more 
particularly, to caching Web-page content on the Internet. 

]o This Is d wimmiwiion-m-part of ccflnmonry-owned, copending U.S. Provisional Patent 
Application No. 60/21 8,559, filed 17 Jury 2000. 

BACKG RO UND OF THE I NVENTION 

When two or more computers are connected so that they can share resources, this is referred 
15 to as a "network". When two or more networks are connected together, ihw is referred lo aa 
an "internet" Qovtcr case ?). The "Internet" (upper cose T) is a a global, internet, which is a 
plurality of intcr-connec(cd networks, all using o common protocol Imuwn as TCP/IP 
(Transmission Control Proiocol/Inlemei Protocol). 

20 The World Wide Web ("WWW", or simply "Web - ) is n set of standards and protocols that 
enable the exchange of information between computers, more panictihuly rrypcrtcxt (HTTP) 
servers, on the Internet, rying thnn together into a vast collection of interactive multimedia 
resources. Traffic across the Internet passes through multiple "backbone" carriers, such as 
UUNet or Digex, which forward the signal* to one another under a system called peering. 

25 

"Quality of Service" (QoS) is a measure of Ihc speed :>nd reliability wlih which information 
can be transported from a content source (typically a server, ot an Internet Service Provider, 
ISP) to a user's computer (upon which is typically running a type of "browser" software tliat 
is used to view content). "Bandwidth" is a measurement of Ilie volume of informal ion that 
?0 can be transmitted over a network (or a given portion thereof) ot a given lime. The bigbct die 
bandwidth, the more data can be transported. Latency is a measure of the time tliat it takes 
for a packet of data to traverse line Internet, from source lo user. 

Bandwidth can he incrcosod by instiuling new capitn) resources, but increasing bandwidth 
35 only indirectly addresses latency, and increasing bandwidth cannot iiccelcratc overloaded or 
slow origin servers. Other solutions arc needed to overcome these barriers. One solution, 
caching content, is disclosed at length in an article entitled "Network Caching Guide, 
Optimizing Web Content Delivery", by Michael A. Gouldt, March J 999, which is available 
online (via the Internet) at die following address: 
4a http^Avww. inktomi. wm/productsmet woiltAech_resourr^s/tecrv'cichingguide.pdf 

A text version of the article is available online at the following address: 
totp://ww.v.gc<>gle.conv r search?q^rc 

techjesomces/tech/cachjngg\)ide.pdf 

45 As noted ib the Upukfc article: 

"One way to improve performance and make it more predictable is lo 
minimize the variability introduced by long trips ncross the Internet from the 
content source lo the user's browser. By storing frequently accessed content nt 
50 a location closer to the user, a great deal of latency and unpredictable delay in 
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the Inleroet can be eliminated The technique for doing ibis is called aching. 
Caching is a means of storing conieni object from a Web server closer to the 
user, where they can be retrieved more quickly. ft* storehouse ot objects, 
including text pages, images, and other content, is called a Web cache 
j (pronounced cash)." (Goujdj^ page 2) 

Caching is an approach that improves Qo3 tor users while improving the 
economics for network service providers. Caching entails storing frequently 
accessed Web content closer to users thereby reducing the number of hops 
10 that must be traversed in order retrieve content thai resides on a remote site. 

■ In operation, requests tor content originate from a user's browser and arc first 
directed to the caching server. If the content is currently contained in the cache 
15 ond is up to date, the content is sent to the user without the browser 's request 

having to be sent on to the originating server. This can reduce the latency for 
the transfer of content and reduce tire amount of time it takes to traasfer the 
content to the user. 11 also reduces tho amount of data the service provider has 
to retrieve upstream from the Internet to fulfill requests, ( {fpuldc. page 8) 

20 

Caching is a concept that is well understood in computer hardware design. 
Modern microprocessors employ on-chip caches of high-speed memory 1o 
store frequently used instructions ami data locally instead of having In read 
them from slower memory. Computer* implement several additional levda of 

25 caching, including RAM cache aod even on-disk cache, all designed to reduce 

tlw latency fix reading instructions and data and speeding up the transfer of 
dala wilhio the system. The bask principle behind caching is lo store 
frequently accessed data in a locttion that con be accessed more quickly than 
it could from the da la's more permanent location. In the case of a system's 

3t> CPU,, the permanent location is the disk. On the Web, the permanent location 

is the origjn server somewhere out on the Internet. (Goulde . page 8-9) 

Network caching, while a newer concept, is hated on exactly the same 
principle and has Ihe some benefit of cost-effectively improving the user 
35 experience. (Goulde. page 3) 

"Caching provides direct benefits to the end user in terms of reduced latency 
of Web page download (the lime you have to wait before anything starts to 
happen) and faster download speeds (the time it takes for all the downloading 

4i> to Hnish). But caching also provides benefits to network service providers. By 

storing, or caching, Web content ihat users request from remote servers 
locally, fewer of these requests have to be sent out over the Internet to he 
fulfilled As a result, the access provider maintaining a cache hw its upstream 
bandwidth requirement reduced. This reduces the bandwidth the service 

4.1 provider has to purchase in order lo provide iis customers with a satisfactory 

Web experience. (Goulde. page 3) 

"Caching also provides several benefits in □ Web hosting environment in 
which an uecess provider boats Web sites for marry customers. It can reduce 
50 ihe impact of traffic spikes caused by high-interest content and also serve as 
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the basis for a variety of value-added services, such as providing additional 
capacity, guaranteed service levels, and replication services. f gojddc, page 3) 

"[Caching] network data makes it possible lo provide users with an optimal 
experience and predictable response limes. With typical cache hit rates, the 
user experience has u higher quality of service (QoS). Improved QoS provides 
significant benefits to ISPs, content providers, and corporate sites. Bcucr QoS 
resists in higher customer loyalty and retention. It helps create a stronger 
brand equity, both for the access provider and for content providers. Content 
providers who ensure lint their content is amply cached throughout Ihe 
network , close to users, will ultimately see more visitors accessing tbeir 
content. (Goulds, page 5) 

"Web browsers also implement a form of caching, storing recently accessed 
Web conlent on Ihe user's bard disk' and reading from that local cache of files 
instead of accessing ihe Jpterrret This works weD when the user hits the 
'Back" and "Forward" buttons in their browser during a session, hut it docs 
nothing if Ihe user is browsing a site for the first time. (Gouldc, page 9) 

•Neither the browser's cache nor a Web server's cache can address network 
performance issues. By placing a enche of Web conlent on ihe network 
between the user and ibe originating Web sites, the distance that commonly 
accessed content has to travel over the Internet is reduced, and users 
experience quicker response and foster performance. Network caching tnkes 
advantage of the fact that some content is accessed mere fretruenlly that odier 
content. By implementing a caching solution, network service providers can 
provide a better Web experience to Ihcir customers while reducing the total 
requirement for high- bandwidth connections to the Internet. fGooldc. page 9) 

"When a caching solution Is not in place, requests for content from a browser 
and the content delivered from the origin server must repeatedly traverse a 
complete trip from tbe jequestiog computer to ihe computer that lias the 
contcut. Tbe Web browser sends a request for a Uniform Resource Locator 
(URL) that refers to a specific Web page on a particular server on lb© Internee 
The request is routed to the server through tbe norma] TCP/IP network 
transport. The content requested from the Web server (also known as an HTTP 
server) may be a static HTML page with links to one or more additional iiles, 
including graphics. The content may also he a dynamically created page that is 
generated from a search engine, a database query, or a Web application. The 
HTTP server returns the requested content to the Web browser one Gle at a 
time. Even a dynamically created page often has static components that are 
combined with the dynamic content to create the final page. (Gouldc. page 
13) 

"When caching is used, frequently accessed conlent is stored close to user. 
It may be stored inside a firewall ou tbe user's corporate LAN, at the user's 
ISP, or nt some other Network Access Point (NAP) or Point of Presaxrc (POP) 
located closer to the user than Ihe majority of Web servers. If the requested 
content, or file, is located in the cache and is current, or fresh, that is 
considered a cache hit. The more frequently user requests can be served from 
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the* caches, the higher H« hit rate, the belter Ihc user 's perfonnancc, and the 
less duin the service provider bos to retrieve from (Ik Internet on behalf of the 
user. (Gouldc. page M) 

5 "Similar issues exist for FIT tile transfers, with nn FTP server handling each 

request for u Hie submitted by the FTP client application. Delays and 
bottlenecks can be u bigger problem with FTP because (he typical size of a file 
transferred by FTP is larger (hen a typical HTML fik. Streamiug audio and 
video are additiojial examples of an Internet application that can benefit from 

10 caching content close to end users. tOoolde . page 14) 

"Network caching can be applied to 'content delivered over many different 
protocols. Inese include HTTP, NNTP, FfP, RTSP, and others. All are 
cliaractcrizsd by having some proportion of static content and high utilization. . 
15 Cache server support for each protocol is, of course, required." (page 1 4) 

"Since (he cache must be kept fresh, there will still be traffic from the JSP out 
to the Internet, even if every bit of content requested by users is in a cache. 
Page freshness has to be assured, and new content must be downloaded But 

20 by using caching, bandwidth utilisation can be optimized. It is even possible to 

use a cache with dynamic content, sinoe even these pages have some static 
content thai can be served from a cache. Depending on the distribution of 
traffic awl (he scalability of (he cache, up to 40 percent of user HTTP requests 
uan be taken off the network ami served from a cache. This makes networks 

35 more efficient, enabling better performance to be offered at lower cost,' 

t£aiide>p«se 14) 

The Gould article discusses (he effective use of caches, load balancing, where to locate 
caches in the infrastnicrtirc, and designing cache- friendly web com en!. There is also 
30 mention of protocols which have been developed - for example, Web Cache Control Protocol 
( WCCP) (page 1 8). There is also discussion of appropriate use of the "expire" and "man- 
age" headers in the HTTP protocol. «ioulric . page 27). And, as expressly stated by Gould, 

"With good design even dynamically generated pages can benefit from 
35 caching. By separating the dynamic content of a page from (he static content, 

the statk content can be cached and the dynamic content retrieved and 
downloaded separately. (Goyjde, page 28; emphasis supplied) * 

It is therefore known to cache static data from Web sites using a dedicated computer's RAM 
•to and hard disk, so thai this computer can act as a proxy between (he WWW ("web") servers 
and ihcir users. Several requests Tor a given Web page can then be served at the cost of a 
single request to the original Web server. Internet Scrvico Providers (ISPs) commonly 
provide such a caching scrvico to their customers. J lowcver, this technique suffers Horn two 
main drawbacks: 

45 - it is not applied globally, and even users of the Web sites who have access to a cache 

have to deliberately activate this feature, nf which they are oflen not awaie; and 

- ii docs Dot apply to pages which change very often, as is the case with dynamic 
conlent (content which is generated on-thc-fly by the remote .server). 

50 There therefore exists a need for the caching of dynamic content, as well as static conlent. 
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Gfojsnry/Deflnitfonj 

Units* otJjerwise noted, or as way be cvidcnl fiom the context of their usage, any terras, 
abbreviations, acronyms or scientific symbols and nutations used herein arc to be given Ihcir 
5 ordinary mowing in the technical discipline to which the invention nK*1 nearly pertains. The 
following glossary of terms Is intended to tend ch»nry and consistency to the vnrious descriptions 
contained herein, as well as in any prior ax! documents which may be cited: 



to Cache Server A highly optimi2td. application that stores {requemry accessed contort 

at strategic aggregation points cjose lo the users requesting that content 
in order to reduce the impact of delays and network bottlenecks. 

G \KP Cache Array Kout i ng Protocol. A protocol for synchronizi ng 

)5 multiple cache servers. CARP maintains a shared namespace tbat maps 

any Web object's address (URL) to only one node in the array. 
Requests are routed to that node. 

Cookie The mart common meaning of "Cookie" on the Internet refers to a 

2ti piece of information sent by a Web Server to a Web Browser that the 

Browser software is expected to save and to send back lo the Server 
whenever the browser makes Additional requests from the Server. 
Depending on the type of Cookie used, and the Browser's settings, the 
Browser may accept or not accept the Cookie, and may save the 
25 Cookie for cither a short time or a long time. Cookies might contain 

information such as login or registration information, crime "shopping 
cart" information, user preferences, etc.. When a Server receives a 
request from a Browser that includes a Cookie, the Server is able to use 
the information stored in the Cookie. For example, the Server might 
30 customize what is sent back to the user, or keep a log of particular 

user's requests. Cookies are usually set to expire after a predetcnnincJ 
amount of time and are usually saved b memory uajil the Browser 
software »s dosed down, at which time they may be saved to disk if 
their "expire time" has not been reached. 

35 

Dynamic Content "Live" content which is updated on a regular basis. Examples of 
dynamic content might include a "current temperature" display on a 
weather web site, search results, or a "Current Top Headlines" item on 
a j icws web site. 

•10 

rfTTF Server A server that implements the HTTP protocol, enabling it to serve Web 

pages to client agents (browsers), lit IP Servers support interfaces so 
tbat Web pages can call external programs. Tbey alfo support 
encry ption mechanisms for securely exchanging information and 
45 nuthenlicntion and access control mechanisms lo control access to 

content 

ICP Internet Cache Protocol. A protocol for synchrorrmng multiple cache 

servers. Hocb time a cache server experiences a miss, it broadcasts 
S& messages to all peer nodes ask ing whether each has the content. The' 
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requesting server then must issue a request Tar the content and forward 
it on to the user. 

Proxy Server A proxy server dels as an intermediary between a use; ;md Ihe Internet 

5 so on enterprise eon ensure security njid administrative- control and also 

provide a caching service. A proxy server Is usually Associated with or 
part of a gateway server thai separates the enterprise network from the 
outside network or a firewall Ihot protects the enterprise network from 
outside intrusion. 

10 

Routers Routers are the devices that build a fully interconnected network out of 

a collection of poiul-Uv point links. Routers un the Inland exchange 
information pertaining to their focal section of the network, part ieulaxry 
how close they are topologically to local systems. They collectively 
1 5 build a map of how to get from any point in the Internet to any other. 

Packets arc routed based oo the exchanged mapping mtbrmorion, until 
the lust router connects dnccily to the target system. 

Static Content "Fixed" or lung-term unchanging component* nf web pages stored as . 

20 fiJes thnt ore cither never changed or arc changed onlv on an infrequent 

basis. 

Switches • High-speed network devices that typically sit on the periphery of the 

Internet. Switches differ fhmi routers in providing higher perfooniance 
25 at a lower price but with limited functionality. Typical switches can 

route traffic locally but aren't concerned with complexities of routing 
found in the high-speed Internet backbone, Switches play an important 
roJc in caching because Ibey are often used lo divert the caehcublc 
traffic to the caching system. 

JQ . 

HTML Hypertext Markup language, A specification based on Standard 

Generalized Markup Langwge (SGML) for tagging text so that it may 
be displayed in n user agent (browser) in a standard way. 

J J HIT? Hypertext Transmission Protocol. An appjicatioo-tevel protocol that 

runs on. lop of TCP/IP, which is the foundation for the World Wide 
Web. 

JP Internet Protocol. Ihe network layer for the TCP/IP protocol suite. It is 

40 a connectionless, best-effort, packenrwitching protocol. 

IP Address A 32-bit address defined by the Internet Protocol that is usually 

represented in decimul notation. It uniquely identifies each computer 
on the Internet. 



Protocol An agreed-upon set of technical rules by which computers exchange 

information. 

UK J . Uniform Resource Locator. The method by which Internet sites are 

addressed. It includes an access protocol and either an IP address or 
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DNS name. An example is hHp:/A*wv».donjaiRwni, 

Usend Short far User's Nei wor k. A collection of tens of i hounds of bid letin 

boards residing no the Internet. Each contains discussion groups called 
newsgroups dedicated to various lopivs. Messages arc posted ami 
responded to over the Network News Transfer Protocol (NNTP). 

Web Server See HTTP Server. 
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BRIEF DESCRIPTION OKTOE INVENTION 

An object of the invention 1o provide a technique foi retracing bandwidth usage of WWW 
servers and improving the QoS of WWW sites. 

5 

According ro the invention, a technique for caching objects haviog dynamic content on the 
Internet generally comprises disposing a cache in the Internet Fot storing uixl updating copies 
of dynamic content The cache may be disposed at n location selected from the group 
consisting of major Internet switching locations, dinl-in aggregation points, and corporate 
1 o gateways- The cache mny a] so store and update copies of static content. 

According to o feature of the invention, update characteristics of the objects we determined, 
and a tin>e to live (TTL) parameter for the objects is adjusted based upon the updntc 
characteristics. UeneraJly, the object is updated if its TrL is less than its n»j:. The TTL for 
) 5 an object may be adjusted to: 

(i) maintain its probability of error below a predetermined error probability threshold; 

(ii) maintain its error rate bciow a predetermincil error probability threshold; or 
<ut) maintain its delay time below a predetermined delay threshold. 

20 According to the invention, a method df responding to a user request for an object having 
dynamic content, said object originating from a server, comprises storing a copy of lae object 
in a cache; establishing a lime to live (TTL) for Ibe object; receiving the user request at the 
cache; 

fulfilling the user request with the stored copy of the object if its TTL is greater than it* age; 
25 and fetching an updated copy of the object and responding to the user request with the 
updated copy if the TTL of (lie stored copy is less than its age. 

According to a feature of the invention, the TTL for the object U first set to o reasonable 
lower limit (Tmin) and is then adjusted based on tbe frequency at which the object actually 
30 changes. 

According to a feature of the invention, each time the cache fetches tike object from the 
server, the cache performs the following procedures: 

a. if another fetch for the same object is ongoing, waiting for the previous fetch to 
3* complete; 

h. fetching the object Emm the server, 

c. replacing the cached copy, if present, by the fetched object, after having coropared 
thero to determine whether the object hod changed since it was last felclied; 

d. initialrztng or updating the object's change statistics accordingly; 

40 e. marking the object ns static or dynamic cooleot depending on the server's reply; and 

f. if the object u dynamic, setting its TTL (T) to an appropriate value with respect to 
an average time between changes (t) determined from the object". < change statistics, the 
number of user requests per lime unit (rfj determined from the objects access statistics, and 
one of the following procedures (A-E): 

4> A. maximum error probability; 

B. maximum error rate; 

C. maximum delay; 

P. any combination of the above (A.B.C), talcing the lowest T; or 
£. any combination of the above (A,B,C), but keeping T within a predetermined 
50 window ofrcatonable" values bounded by Tmin and 7max. 
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Preferably, the caches are dedicated machines and are placed so thai Web browsing passes 
through Ihe cache instead of going all the way (o the original sites, in many different 
locations, ideally within (he network of ISPs providing the Internet connectivity to Ihe highest 
5 number of users tn those locations. In this mnnncn 

- the users of those ISPs and. to n lessor extort, neighboring ISPs, will crrjuy a huge 
QoS and speed improvement, for most of the traffic wil] stay within or clow to Ihe JSIY 
internal networks and not need to go through Ihe highly-loaded Internet backbone; and 

- the original wtb-o'tes will no longer need as much bandwidth, since the caches will 
1 0 absorb most of the load. 

Since many wcb-*ites tend to serve ail- dynamic contest, dynamic content will be cached, as 
well as static content. However, H is not merely a matter of the caches remembering the 
com cm, as there is no iniUcation as to when the latter is going to change, Hence, it needs to 
15 be reloaded, peribcficaJly, instead of simply serving the cached copy. 

Therefore, the cache reloads a page (fetches an object from the server), whenever its 
corresponding cached copy has not been refreshed for a given time (time to live, "TJX"). 
TTL can be thought of as the "shelf life" of the page The coche first, sets the TTL for a 

20 dynamic object to a reasonable lower limit T-j» Then, over time, as the cache reloads the 
page several times, it can track the frequency at which die page actoaWy changes content, and 
adjust (he TTL for that page accordijigly. This "TO/ teclwiquc mimics a common caching 
method for Static pages, if the original server of the page /specifics a TTL, But since servers 
for dynamic pages do not specify a TTL for them, the cache has lo establish a reasonable 

25 TTL c-f its own accord. 

"Iherefore: 

• whole web- sites can he cached, including their dynamic content ; 

• traffic between caches and the original sites is better managed; 

30 - while some users might gel an outdated content, in the ease of a page changing 

before the cache reloads it, it is possible to limit the delay between the two (which could 
happen anryMuy witliout the cache, when the network gets salivated) as well as Ihe error 
probability; 

- the system can adapt, in real time, nccordiug lo the number of requests to each page 
35 and the actual update frequency of the page. 

Other objects, features and advantages of the invention will become apparent in light of the 
following description thereof. 
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BRIE? DESCR IPTION OF 




Reference wilt be ctiikJc in dtlid) to preferred embodiments of I he invention, examples of which 
may be illustrated in die accompanying drawing figures. The figure* lire intended to be 
illustrative, not limiting. Although »bc invention is generally described m 0k context of these 
5 preferred embodiments, ii should be understood lhal it is not intended to limit the spirit and 
scope of the iuvcntkm to these particular embodiments 

In flowcharts presented herein, reclancitliur boxes generally represent a sequential step being 
performed, a generally rectangular bo* with pointed ends represents a decision step (test) 
10 having two mntiwlly-exchisive results ("Y"»Yes; "N^-No), und an empty circle is not a step or 
a test, but is merely a graphical junction point at which two or more paths in ti>e flowchart 
converge. 

The structure, operation, and advantages of the present preferred embodiment of the invention 
IS will become further apparent upon consideration of lire following description, taken in 
conjunction with the Accompanying figures, wherein: 

Flgitrr 1 is a greatly simplified schematic ilhwtration of the mternet, Uhtstwting an 
embodiment of die caching system of the present invention; 

20 

Fignre 2 i* a flowchart illus-lrattng hew user requests for static and/or dynamic content are 
handled by the cache, according to the invention; 

Figure 3 is a graph illustrating an average error probability, according to an analysis of the 
25 invention; and 

Figure 4 is a graph illustrating the evolution of error probability, according to an analysis of 
the invention. 
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DKTA11 JJ> DESCRIPTION OF THF INVENTION 

Figure 1 is a greatly simplified schematic ilfusrrotion of the Internet environment, illustrating 
, an embodiment of the caching system of the present invenlion. Generally, a user 102 makes 

5 a "request" for an "object 1 * (e.g., a Web page) which is made available on the Internet by a 
server (e.g^ ISP) .104. (Tlie object typically originates at a conlent provider, not shown.) A 
switch 106 interfaces a cache (cache server) 108 to the Internet. The cache may contab a 
copy of the Web page There me two possible * responses" to the user request - either the 
server "serves'' (or "services'") the request, or it is "fulfilled" in the cache. In the latter case, 

10 the content must first have been "transferred" to fbo cache, which may periodically "fetch" (or 
"reload") updated Web page content from the server. These I amis will be adhered to, to the 
extent possible, in the discussion that follows. 

As discussed in the Goultte article (eg., page 18, Illustration 4), the switch 106 may be a 
)&. high-performance processor that can look ot network traffic nod make routing decisions based 
on protocols above the I? level. M a result, the switch can direct HTTP (and oi her) traffic to 
caches (108), and send the rest of the traffic directly to the Internet. This is exemplary of 
one of a number of possible system architectures. Which architecture is used depends on 
several factors, including where the cache is implemented, the primary purpose of the cache, 
20 and the nature of the traffic. 

As further discussed in the Gouldc article: 

"Gicbes can other be deployed in a transparent or ooritransparent form. A 
25 nonlransparent cache is explicitly visible, nml browsers or other caches that 

use the cache arc overljy configured to direct traffic to the cache. In this case, 
the enche acts as a proxy agent for ihe browser, fulfilling requests when 
possible and forwarding requests to the origin server when necessary. 
Ncmtwnsparent caches are often a component of a larger proxy server acting 
30 as pari of a gateway or firewall and addressing many different applications. 

fOonldc . pages 1 6- 1 71 

A transparent cache shs in the network flow and functions invisibly to a 
browser. For ISP and enterprise backbone operations, u transparent 
3J configuration is preferred because it minimizes the total administrative and 

support burden of supporting users in configuring their browsers to find the 
cache, (Gouldc. page 1 7) 

Caches should be implemented transpuicntly to maximize the benefits of 
40 caching. A nonbonsparvnt implementation requires having browsers manually 

configured to direct their requests for content to the cache server. In a 
transparent configuration, cache benefits are delivered to clients without 
having to reconfigure the browser. Users automatically gain the benefits of 
caching. (Gouldc . page 17) 

45 

In an enterprise environment, transparency can be implemented cither through 
automatic browser configuration or by intercepting traffic on the net work- 
Both Netscape and Microsoft provide utilities for centrally con figuring large 
networks of browsers and for customizing browser eonfigiiratiorts being 
50 installed on users' PCs- Browser plug-ins can also provide automatic 
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configuration. Although this approach is transparent to Ibc user, il {Iocs 
require admuiislriitive effort on an ongoing basis. (Gptrlde. pnge 17) 

There arc several options for providiog transparent caching: 
5 - The cache can be configured as if it were a router so that all Internet-based 

traffic is aimed at it. This is a transparent configuration that requires no 
configuration of the browser; the browser or do*vnstream cache is unaware of 
the cache's existence but still benefits from il. The downside is that the system 
on which U« cache resides has to devote some of its resources to routing, end 

jo the cache becomes a mrssior^critical part of the network. Sophisticated router 

configuration witb policy-based routing can minimize some of these issues by 
only directing HTTP (TCP Port 80) traffic to the cache, bypassing the cache in 
the event of failure and sending traffic- directly to the Internet. (Gouhjc . pnge 
17) ~ 

15 - An increasingly popular option is to rrse a Layer 4 switch to interface the 

caclie to the Internet (sec II lustration 4), 'Jhese switches, currently offered by 
Alteon, Foundry, ArrowPoint, untl others, are high-performance processors 
that can look at network traffic and make routing decisions based on protocols 
above tin: JP JeveL As a result, the switch can direct HTTP (and other) miilc 

20 to the caches and send the rest of the traffic directly to the hit erne t ... [Tjhc 

switch can parse the HTTP Teeniest and send the request to a specific node in a 
cache form based on the URL requested. Using an intelligent switch keeps 
unnecessary network traffic off the cache, simplifies designing for availability, 
and distributes loading on the cache farm based on specific URLs, (Goul ds, 

2S page 18) (An architecture similar to this one is described hereinabove with 

respect to Mgnre 1) 

- Another option for transparency is ibe Web Cache Control Protocol 
(WCCP). WCCP was developed by Cisco Systems to aUow Web caches to be 
transparently installed in networks using Cisco lOS-based routers. With 

30 WCCP, HTTP traffic is redirected to the Weh cache instead of the origin 

server. WCCP does not require any changes to the network architecture, 
thereby simplifying the iroplementutiuo of transparent Web caching. (Ooulde. 
page IS) 

- Web Proxy AutodisccA'cry Protocol (WPAD) is a new proposed standard 
35 protocol, which, when integrated with browsers, streaming media clients, and 

other Internet client software, is designed to automatically locate caches and 
services on Ihe network without requiring any configuration by end users. 
WPAD provides a flexible, vernlcr-nciilm! soflwure nlternntive to existing 
cache harwpaj-ency solutions that utilize routing or switching equipment In 

49 the fpUrrc, WPAD -enabled client software will automatically connect users 
with embedded network sci Vices in their rcgiun, providing simplicity for botb 
users and the network providers that deploy these services. fOoulde . pages 
18-19) • 

43 Caching systems can be used to optimize the performance of a Web server site 

US well as to speed Internet access for Web browser users, tn a reverse proxy 
configuration, the caching system sits in fronl of one or more Web servers, 
intercepting traffic to those servers and standing in. or proxy ing, for one or 
more of the servers. Cache .<<crven; can be deployed throughout a network, 

50 creating a distributed network for hosted content. When necessary the proxy 
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cache server will request <Jynwuic and other short-lived content from the 
origin servers. Thw enables content from ihc Rite to be saved from a local 
cache instead of from tivs origin server. The proxy server con be optimized for 
high pesforrmmee, efficient operation, conserving resources, owl off-kanling 
5 the origin server from serving static content. Reverse proxy caching provides 

benefit* lo tbc access provider as well as to the user. Those benefits ioehide 
the ability to enable load balancing, provide peak-dcroand insurance to assure 
availability, and provide dynamic mirroring of content for high availability. 
< Gouldc. page 20) 

10 

There are three general characteristics that describe where caches axe best 
located on n network: 

- Choke Points in the Network Traffic. There are locations where a lovge 
majority of network traffic posses and is therefore visible lo ibe cache scrvci. 

15 ThU allows the enche to handle more requevts and store more content tl>an if 

located somewhere that can be easily bypassed 

- Points with High-Network Load. High traffic conditions nllow higher 
cache utilization nnd therefore greater bent tils can be achieved. 

. Locations that Produce Greatest economic Benefits for a C»cbe. Points 
2D where users will benefit from high cache hit rotes while also reducing 

upstream bandwidth requirements will provide both QoS benefits and 
economics ("or tbc access provider, locations with these cliaractcristics are 
typically found at major Internet switching locations, diol-ra aggregation 
points, or corporate gateways. (Qpulde, page 20) 

25 

Locations with these characteristics are typically found at major Internet 
switching locations, dinMo aggregation points, or corporate gateways, 
including: 

POP aod DIAL-UP (sec Gould c. page 21, Illustration 7) 
30 NAPS (see Gnu Me. page 22. Illustration &) 

Satellite (sec Gouldc page 22) 

International Gateway (see Gouldc . page 23, Illustration 9) 
Web hosting (see QwlQe, page 24, Illustration JO) 
Last Mile (see Gouldc page 25, Ulustmtion 1 1) 
33 Corporate Gateways (sec Coulde. page 25, Illustration 1 2) 

A person having ordinary skill in the art will readily understand where to locate the c»cbc(s) 
of (he present invention ip light of the description set forth hereinabove. Generally speaking, 
the cache of the present invention can be located anywhere that there is (or could be) a cache 
40 saving stotio content, or it can be incorporated into an existing cacl)e which fulfills requests 
for static content, with the additional functionality enabled according to the techniques set 
forth below. Or, it coo be uiwided as a separate, dedicated machine (computer). 

Figure 3 is a flowchart illustrating how user requests for static and/or dynamic content are 
45 handled by the cache. 

In a first step 202, for a user request fox an object, the cache determines whether the 
requested object is in cache. If not (N), the user request is passed on lo the server for 
servicing the request end meanwhile, in a step 204, the cache fetches the object from die 
5t? server in anticipation of the next request fur the object from the same or another user. 
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Generally, in this example, rtll requests for objects ore presumed to go through a cache server, 
which is transparent to the user. It intercepts information requests and decides whether it will 
provide a response from s cached local copy or from a remote information source. After 
J fetching information from a local source, ibe cocho server decides whether to store it locally, 
and if ro, for how long. A request for information which can be provided from a local copy 
is known as a "cache hit". Conversely, a request for information which is not Stored locally is 
known a "cache mije". When the storage determination algorithm is well-designed the 
protmbrlily of a cache hit is greatly improved, and apparent response time to user requests 
10 (related to QoS) is reduced. Farther, every Inl'orrnation request satisfied by locally cached 
content (cache hh) reduces traffic on the external network, pcrreiUiny shorter response times 
over the external network. 

If the requested object is in the cache, it is next determined in a step 20$ whether the 
15 requested object is marked as static. If so (Y), it is iben ilclcrmincd in a &(ep 208 whelher to 
updaie ihe cached copy or to use it to fulfill the user request using any suitable standard 
algorithm for caching static objects, such as comparing Hie objects *>gc" (the time elapsed 
since it has last been refreshed) to the TTL (if the original server of the page specifies a 
TTL), asking Die server the latest modification time, etc). 

39 

If too requested object is in cache, and it is dynamic (N, step 206), it is determined in a step 
210 whether the cached copy's TTL ("shelf life") is less than its age. 

If the cached copy's TTl. is Jess than (a lower number than) its age (Y, step 21 OX it is 
25 considered ro be "stale-, and in a step 2 1 2 the cache: 

- updates the object's access statistics (number of user requests per time 
period - liLSt few minutes, last hour, etc.); and 

- fetches the object from the original server. 

30 It the cached copy's TIT is equal to or greater than its age (N, step 2 1 0), it is considered to be 
"frcsh" ; and in a step 21 4 the cache: 

■• fat/iNs the request using the caclted copy; und 

- updates the ohjeefs access statistics. 

35 Optionally, if the time difference between the cached copy's age and its TTL is less than a 
given time, and the number of recent user requests i3 more than a given rale, it is considered 
to be "u^ed" anil "popular", and the cache fetches the object from Ihe server in what is termed 
an 'anticipated refresh". 

40 Additionally, for each time Ibe cache fetches an object from the server (see steps 204 and 
212), the following procedures are performed by the cache, in a step 216: 

a, if another fetch for the same object is ongoing (e.g.. due to a previous user request), 
the cache warts for Ihe previous fetch to complete, rather than duplicating it request; 

b. fetches the object from the scrver, 

<5 c. replaces its cached copy, if present, by the op-to-daie object, after having compared 

them to determine whether the object had clianged since it was lasl fetched; 

d. initialize or updute the object's. change statistics (number of changes per time 
period) accordingly; 

e. mark the object as static or dynamic content depending on the original server's 
50 reply (presence of o modification lime, or of a oprttero TTL, etc.); ■ 
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f. if the object is dynamic set its TTL (7) to the appropriate value with respect to the 
average time between changes x (dtieitruncd from the object's change statistics), the number 
of user requests per time unit rj (determined from the objects access statistics), and a selected 
one of the following procedures (A-E): 

A, maximum error probability ( po > which is the average )id\t> of the number of 
requests fulfilled using a cached copy who* cwesponding w»mJ object has clumgcd for 
more than a given time W, over the total number of requests 




g. maxirmrrn error rate ( «o ), which is the overage number per time unit of requests 
fulfilled using a cached copy whose corresponding original object has changed for mote than 
a given time }Y: 







i 




w 
















v n J 





20 

maximum delay ( D 9 \ wliich is the average time between an object change and 
when the cached copy is refreshed: 

- 6A, 



D, say coraburation of the above (A3.C), taking the lowest T, ox 

Ej any combination of the above (A,B,C) V but keeping T within a predetermined 
window of •reasonable" values bounded by /rain and /max. 

In the above equations, the following parameters have been used: 
T is Time To 1 jve for the dynamic object; 

IV is a given linic since the original object Ihls changed (i.e., bow long it is outdated); 
x is on average time bei>veen changes, which is determined from the object's change 
statistics; 

n is number of user requests per time unit (e.g., frequency); 

po is maximum error probability, which is the overage ratio (if the number of requests 
fulfilled using a cached copy whose corresponding original object lias changed for 
more than the given time W, over the total number of requests; 
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rj p is maximum error rate, which is (he average number per lime unit of requests fiiliJ lied 
using a caclted copy whose' corresponding original object has changed for more than 
0k given lime W; 

Do j.» maxirmren delay, which is the overage time between an object chanyc nod when the 
5 cached copy is refrcsJied. 

Selecting uni Adjusting TTL 
A. simple model, which takes no other pommel er than the average Ihne between changes (i) 
of the object is the exponential decrease model - that is, the probability that the object docs 
to not change during a Umc period of length / li considered to be of the form c 4 1 \ It is, of 
course, I for / » 0, and taxis to 0 as t tends to + (co) 

Considering a situation with many user requests, as compared to ibe refresh rate of the cached 
copy, 50 the cache will update at regular intervals, despite being oscr-driyen. 

15 

The model's consistency is now checked, especially whether the average update frequency of 
a givco object is 1/ T. Let p',0) be the probability tltat * changes over a lime period of t. 
By construction: 



And. for k>0, p\ = [Mrift-f" w yi*~*»* - [plM^* 



(the probability that k changes occur over [0:1) is the sum over « of the 
probability that * - 1 changes ocvurred between 0 and «, that (here was a 
cliange between « and w+iIm, ami none bcrween «+d« and t\ 

One can dcttuce by recurrence that: 



i / t 



= (t!t>(t) 
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Now the overage number of changes over ti lime period ofi is (by definition of averaging): 



5 Htnce, ihc changes do h3vc an average frequency of 1/ x. 
Average error probability 

The error probability is defined as being the percentage of cases where Die user receives the 
cached copy wherens the object has changed on the original server, ;mj the cached copy is 

to outdated by more than a given lime W. Figure 3 ilrustrutes the percentage error (vertical 
axis, in perccRtagcs) versus time (horizontal axis, in seconds) for instance, with W-]5s, t - 
60s, and I"- 30s. The graph illustrates the probability that the cached copy is "state* by more 
than W seconds, in the graph, w is 15 seconds, so the probability is zero between I and 15 
seconds. Then, the probability rises until 30 seconds, when u re-feteh occurs (T- 30s), as 

15 illustrated buy the sawtooth pattern 302. This pattern repeats itself every 30 seconds. 
Between 30 and 45 seconds, the probability is again zero, and a re-fetch occurs at 45 seconds, 
and the probability rises unUl 60 seconds, as illustrated by the sawtooth pattern 304. A 
similar result is shown by the sawtooth pattern 306 between 75 and 90 seconds. 

20 Oivcn that the cache is supposed to update the copy of the object at regular intervals, the 
calculations can be performed over a time interval [0;Tj. 

The average time £ within the intexvnl during which the content is outdated U the sum over 
all intervals [tr, tr+d/i] of the probability that rhere was no change between 0 and u, bat that 
25 there was a change between u and «Hl«, multiplied by the length of time during which the 
content remains outdated by more than W. That is: 
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or, vkiih 



A' =r{a -! + «-*) 



From ibis, the error probability can be deduced as: 

yC;r rr~el-(a - Iff - ') 



Oasirr-lo-ruimllc bounds can be calculated by letting: jr/ fl ) *1 > 0 _ j + c - 
15 6 2 



Its derivatives are: F' (a) ■ — - — a+i~*~" 



F"(a)=)-e"* 

Sijtce F(0) = F*(0) = FtO) - 0, and F"(a) j.s positive for any a >= 9, rW ""J- 

ultimately, I : (U) ore os well. So: 
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There are better, eosy-to-find bounds for tbe case of u>=l , but lhat is noi .in interesting C3se 
(T-W>=x). Hence; 
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Mrounwl TTL to etisort a given error probability 

Aa upper bound should be set on TTL which still ensures that the error probability is below a 
given threshold p#. 



Pen Z pO c= ~(<r -l+O* p % 



T 2 y * 

{T-W) : , _ ._ 
i 2p*r 

r 

?W<T <<fV + r/ 
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Therefore, an acceptable TTL is: 



Maxima l TTL jo. ensure ft given, ttTOr nle 
10 Ibis is the same as for the enrr probability, replacing/^ by /r ft / n as: 



<=T5 







i 


rf 























25 Therefore, an acceptable TTL is: 







7 


4 


W 








ir + ri 











35 Average delay for a tavoi TTL . 
It is useful to consider the average delay T) between tFic cached copy and the original objeel 
in the case where the latter has chained during a given interval. Tint, cofcwlato <Ar(w>> 
which is the average delay knowing that the change occurred at a time « between 0 and T. 



40 
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From this, ibc overall delay <At> can be tlcdueed by integniting <Af[uy> times the probability 
that a cbnn&e occurred between « and a t J o. over [0;7]: 



<6i> 



25 Since Ihc <At> calculation does not tnVe wlo account Ibat there has been n change within 
[O-.T], it should be divided by Ihe corresponding probability to get tk 



30 



(53) 



JP 2004-504681 A 2004. 2. 12 



PCT/I) J) 1/0065! 



andsince: fL_£_ + c 
6 2 

and: —-a + 1-*" * 0 



To enter to limit J> to a I'll sftoukJ be chosec, roch as: 

T T 
6 2t 



r sa 
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From Ihc above, the following tables of Tit (1* ($)) versus a uurober of crrw probabilities, 
Jbi various values of T and Wean be derived: 





,T = 30a 
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From lie above, ibe following conclusions can generally be made. 

10 llje probability of the retrieved infonwtion being "stale" (older than W) is essentially zero 
foj values of T(s) (Time to live) less Uwd or equal to W and increases with increasing Tfs) 
according to a decrying exponential, approaching 100% probability of error at infinity. This 
observation that tlw error probability is zero for values of T{s) less than or eq\t«vl to W is 
essentially * "trivia! " result, since it is clear Ihut no information can be older than W if it is 

15 updated more freiruently than W. Clearly, however, ihe average opdatc interval t has a 
significant elTect on bow steeply the eww probability climbs for vahies of T(s) greater than 
W. The greater the average update interval t with respect to W. ihe less sharply the error 
probability rises with increasing values of T(»). 

20 Figure 4 is a graph 400 illustrating the evolution of error probability p m (vertical axis, 
expressed as a percentage (%)), as a function of the TTL (in seconds (s)) fbi various values of 
x (W - 1 Ss). The line 402 is for t « 20s, the line 404 is for x = 60s. the line 406 is for T = 
300(5, and the line 408 is for x - 600s. 

23 A graphical Illustration of a technique for choosing o TfL.to maintain error probability 
below a threshold value ji^is obtained by identifying the value of g^oti the vertical axis of the 
graph 400 ond following an imaginary line horizontally across the graph 400 to where it 
intersects the cuive (402,404,406, or 40S) for tbc appropriate value of x. By way of example, 
if an error probability threshold value of 1 0% is chosen, then the TTL for x - 20* (lint 402) 

5ft is n Utile under 30 seconds, for x - 60s (line 404) it is about 35 seconds, for T = 300s (line 
406) it is shout 85 seconds, and for x - 600s (line 408) it is about 150 seconds. 
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By deploying caches for static and dynamic conimil at nppjopriate locations in the Interact, 
and by selecting appropriate update characteristics for cached dynamic content as described 
hereinabove, effective user response rinvts to both dynamic and static content can be reduced 
white simultaneously reducing congestion on the external network (i.e., the Internet). 

5 

The first time an end user (110) receives o web page, the web page moy optionally plant a 
cookie io lac user's browser, "InereaAcr, every time the end user accesses ibe web page, the 
browser sends the cookie along with Hie access request. The cookie Id la the server (104) 
what the server wants (or needs) to know about ihe end user, in addition to simpry the fact 
10 that the user wonts to retrieve tbc web page. For example, if ibe web page is a persona! iz*d 
web page of a single end user, the server knows the end user's preferences. 

According to u feature of the invention, the cucbe (108) can look tit the cookie and see rf the 
requested web page is a personalized page of a single end user, or if the cookie indicates the 
15 access request is coming fiora a global end user. If the requested web page is a personalized 
w«b page for a single end use* (ot if there is some rMher indication that the web page is 
supposed to be modified each time that it is accessed), then tlx web page should not be 
• cached This can be accomplished by setting TI L = 0. Else (e,g., gjabal end user), TTL is 
si mpry set as described hereinabove. 

70 

Although the invention has been described with respect to a limited number orcmbedimcnts, 
it will be appreciated (hat rainy variations, modifications aj)d other applications of the 
invention may be made, end are intended to be within the scope or the invention, as disclosed 
herein. 

25 
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CLAIMS 

Whal is claimed is: 

1 . System for caching objects laving dynamic content on the Internet, comprising: 

a cache connected in Ihc lateruvt for storing and updating copies of dynamic content. 

2. System. According to claim t, wherein: 

(he cache is disposed at a location selected from the group consisting of major Intermit 
switching locations, dial -in aggregation points, and corporate gateways. . 

3. System, according to claim I . wherein: 

the cache also stores and updates copies of static conir.nl. 

4. System, according io claim 1 , further comprising: 

means for monitoring oiie or more of the objects to determine update characteristics 
thereof; and 

means for adjusting a lime to live (TTL) parameter for the objects based upon t!tc 
update characteristics, 

5. System, according to claim 4, further comprising; 
means for determining an age of an object; and 

metuis for updating the object if ihe TTL. for the object is less than its age. 

C>. System, according to claim 5, wherein: 

Ihc TIL for n»ch objeel is calculated ncconling lo an average time of changes for the 

object. 

7. System, according to claim 4, further comprising: 

means for determining a probability of error for each of the objects; and 
means for adjusting the TTL for each of the objects to maintain i is probability of 
error below a predetermined error probability threshold. 

8. System, according to claim 7, wherein: 

the TTL for each object is calculated according lo the equation: 




wherein: 

T is the Time To Live, for the dynamic object; 

W is a given time since the original object has changed (i.c, how long it is 
outdated); 

T is an average time between changes, which is determined from the object's 
change statistics; and 

po is maximum error probability, which is the average ratio of the number of 

requests fulfilled using a cached copy whose corresponding original object has 
changed for more th*n the given lime fV, over ihc total number of rcqnests. 
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9. System, according to claim 4, further comprising: 

means for determining an error rote for each of the objects: and 
means for adjusting TfL for each of the objects lo mainruin its cjtot rale below n 
predetermined error prubabttity threshold. 

1 0. System according lo cjajro 9, wherein: 

the TTL for each object is calculated According to the equation: 



wherein; 

T is (he Time To Live for the dynamic object; 

W is a given time since the original object has changed 0 c, how long it is 
outdated); 

X is na overage time between changes, which Is determined from the object'* 
change statistics; 

n is number of user requests per tinxs unit (e.g., frequency); and 

rt» is maximum error rote, which is the average number per time unit of requests 

fulfilled using a cached copy whose corresponding original object has changed 

for more than the given time W. 

\ 1. System, according to claim 4, further comprising: 

means for determining a delay time for each of the objects; and 

means Tor adjusting TTL for each of the objects to maintain its dejgy time below a 

predetermined delay threshold. 

1 2. System according lo claim 1 1 , wherein : 

the TT1. for each object is calculated according to the equation: 



wherein: 

T is the Tunc To Live for the dynamic object; 

i is an average time between changes, which is determined from the objeefs 
change statistics; and 

1>» is maximum delay, which is the average time between an object change and 
when the cached copy is refreshed. 

13. Sysiem, according to claim A, further comprising: 

means fot determining at least one object characteristic selected from the group 
consisting of error probability, error rate and delay time for each of the objects; and 

means for adjusting TIL lo maintain the selected object chwwterties below a 
respective threshold value. 
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14. System, according to claim 1 3, further comprising: 

means for limiting adjustment of TTL for each of the objects to a range bounded by 
predetermined minimum (Tmin) raid mnximum (7max)vsliits for 'ITU 

15. Method of responding to a user request for information liaviug dynamic content, 
comprising: 

storing a copy of the dynamic content in a cache; 
establishing a time m live (TTL) for the dynamic content; 
receiving the user request at the cache; 

responding to the user request with the stored copy of the dynamic content if its TI L 
is greater than its age; .ind 

retrieving on updated copy of the dynamic content and responding to the user request 
with Ihe updated copy if Ihe TTL of the stored copy is less than its age. 

1 6. Method, according to claim 1 5, further comprising: 
storing a copy of static content in ibe cache. 

17. • Method, according to claim 15, further comprising: 

determining an average update frequency for the dynamic content; and 
determining the TTL for the dynamic content as a function of its overage update 
frequency. 

1 8. Method, according to claim 1 5, further comprising: 

del ermining an average update frequency for the dynamic content; and 
determining the TTL for the dynamic content as a function of its average update 
frequency and a predetermined error probability threshold. 

19. Method, according to cJaim 18, fwi)w comprising: 

adjusting the TTL for the dynamic content according to a frequency of usct requests 
for Uw dynamic content. 

20. Method, according to claim 1 5, wherein r 

the TTL for each object is calculated according to the equation: 




whcicin: 

T is the Time To Live, for the dynamic object; 

W is a given time since the original object has changed (i.e., bo* long it is 
outdated), 

T is on average time between changes, which is tletennined from the object's 
change statistics; nnd 

is maximum error probability, which is the average ratio of the number of 
requests fulfilled using a cached copy whose corresponding original object has 
changed For more than the given liroe iV, ovct Ihe total number of requests. 



21. 



Method, according to claim 15, wherein: 
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the TIL for each object it calculated according to Ihe equation: 









w 


i - 


W + T — *■ 



wherein: 

r is the Time To Live far the dynamic object; 

If is n given time since the original objecl has changed (i.e., how long it is 
ootduted); 

t is an average rime between changes, which is determined from the object's 
change statistics; 

n a rrombcr of user requests per lime unit (e.g., frequency); and 

is maximum error nite, which is the average number per time unit of requests 
fulfilled using o cached copy whose corresponding original object has changed 
for more than the given time >K 



no 



Method, according to claim 1 5, wherein: 

(he TTl, for each object is calculated according to the equation: 



1+3 



wherein: 

T is (he Time To Li ve for the dynamic object; 

t is an average lime between chnngts, which is determined from the object's 

. change statistics; and 
Do is maximum delay, which is the average time between ait object clvange and 
when the cached copy is refreshed. 

23. Method, according to claim 1 5, wherein the information is represented as n web page, 
I be method further comprising: 

a 15 rat time the user receives the web page, the wtb page plants a cookie in the user's 



24. Method, according to claim 23, wherein: 

every subsequent tine the end user requests the web page, the browser sends the 

cookie. 



25. Method, according to claim 24, further comprising: 

the cache looks at the cookie and sec if the requested web page is a personalized page 
of a single end user. 

26. Method, according to drum 23. further wimprising: 
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if the requested web page Is a personalized web page for a single end user, then the 
web page is not cached 

27. Method, nccor ding to claim \ 5, further comprising: 

if (be information is supposed to be modified each time it is accessed, setting TTL ~ 

0. 



28. Method of responding to a user request for ua object buvtog dynamic content, said 
object originating from a server, comprising: 

storing a copy of the object in a cache; 
establishing a time to live (TTL) for the object; 
receiving the user request at the cache; 

fulfilling the user request with the stored copy of the object if its TTL is greater than 
its nge; and 

fetching an updated copy of the object and responding to the user request with the 
updated copy if the TTL of the slorcd copy is less than its age- 

29. Method, according to claim 28. further comprising, in the cache: 

first setting the Til foe the object to a rcasooahle lower limit (Train); and 
adjusting the TTL for the object based on the frequency rtt which the object actually 
changes. 

30. Method, according to claim 28, further comprising: 

each time the cache fetches the object from the server, performing the following 
procedures: 

a. if another fetch for the .%ame object is ongoing, waiting for the previous fetch to 
complete; 

b. fetching the object from Ihe server, 

c replacing Ihe cached copy, if present, by the fetched object, aflcr having compared 
them to determine whether the object had changed since it was last fetched; 
d. initializing or updating the object's change statistics accordingly; 

c. marking the object as static or dynamic content depending on the server's reply; and 
f. if the object is dynamic, setting its TTL (7) to an appropriate value with respect to 

an average lime between changes (i) determined from the object's change statistics, ihe 
number of user requests per time unit (n) determined from the objccls access statistics, and 
one o f the following procedures (A -E): 

A. maximum error probability; 

U. maximum error rate; 
' C. UNUUflnvD delay; 

D. ww combination of the above (A r B,C), raking ihe lowest T\ or 
R any combination of the above (A,D,C), bot keeping T within a predetermined 
window of "reasonable" values bounded by Train and Tmax. 

31 . Mclhod, accoiding to claim 28, wherein the information is represented as a web page, 
the method farther comprising: 

a first tiujc the user receives the web page, the web page plants a cookie in the user's 
browser. 

32. Method, accoiding to claim 3 1 , wherein: 
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every subsequent Uroe the end user requests the web page, the browser scads the 

cookie. 

33. Method, according to claim 32, furll** comprising: 

the cache looks at the cookie nnd sec if the rarucsted web page is n personalized page 
of n single end nser. 

34. Method, according lo claim 3 J , further comprising: 

if the requested web page In a personalised web page for a single end user, then the 
web page is not cached. 

35. Method, according to clai m 2 8, further comprising: 

if the information is supposed to be modified each time it is Accessed, selling TfL ~ 0. 
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